Overview

Dataset Statistics

Number of Variables 13
Number of Rows 299
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 30.5 KB
Average Row Size in Memory 104.4 B
Variable Types
  • Numerical: 7
  • Categorical: 6

Dataset Insights

age is skewed Skewed
creatinine_phosphokinase is skewed Skewed
ejection_fraction is skewed Skewed
platelets is skewed Skewed
serum_creatinine is skewed Skewed
serum_sodium is skewed Skewed
anaemia has constant length 1 Constant Length
diabetes has constant length 1 Constant Length
high_blood_pressure has constant length 1 Constant Length
sex has constant length 1 Constant Length
smoking has constant length 1 Constant Length
DEATH_EVENT has constant length 1 Constant Length
  • 1
  • 2

Variables


age

numerical

Approximate Distinct Count 47
Approximate Unique (%) 15.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 60.8339
Minimum 40
Maximum 95
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.4209)

Quantile Statistics

Minimum 40
5-th Percentile 42.9
Q1 51
Median 60
Q3 70
95-th Percentile 82
Maximum 95
Range 55
IQR 19

Descriptive Statistics

Mean 60.8339
Standard Deviation 11.8948
Variance 141.4865
Sum 18189.334
Skewness 0.4209
Kurtosis -0.2018
Coefficient of Variation 0.1955
  • age is not normally distributed (p-value 1.4162537711951494e-07)

anaemia

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (0, 1) take over 50.0%
  • anaemia has words of constant length

creatinine_phosphokinase

numerical

Approximate Distinct Count 208
Approximate Unique (%) 69.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 581.8395
Minimum 23
Maximum 7861
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • creatinine_phosphokinase is skewed right (γ1 = 4.4407)

Quantile Statistics

Minimum 23
5-th Percentile 59
Q1 116.5
Median 250
Q3 582
95-th Percentile 2263
Maximum 7861
Range 7838
IQR 465.5

Descriptive Statistics

Mean 581.8395
Standard Deviation 970.2879
Variance 941458.5715
Sum 173970
Skewness 4.4407
Kurtosis 24.7105
Coefficient of Variation 1.6676
  • creatinine_phosphokinase is not normally distributed (p-value 8.289860482271499e-19)
  • creatinine_phosphokinase has 29 outliers

diabetes

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (0, 1) take over 50.0%
  • diabetes has words of constant length

ejection_fraction

numerical

Approximate Distinct Count 17
Approximate Unique (%) 5.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 38.0836
Minimum 14
Maximum 80
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ejection_fraction is skewed right (γ1 = 0.5526)

Quantile Statistics

Minimum 14
5-th Percentile 20
Q1 30
Median 38
Q3 45
95-th Percentile 60
Maximum 80
Range 66
IQR 15

Descriptive Statistics

Mean 38.0836
Standard Deviation 11.8348
Variance 140.0635
Sum 11387
Skewness 0.5526
Kurtosis 0.02072
Coefficient of Variation 0.3108
  • ejection_fraction is not normally distributed (p-value 3.290116169438477e-07)
  • ejection_fraction has 2 outliers

high_blood_pressure

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB
  • The largest value (0) is over 1.85 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 1.85 times larger than the second largest value (1)
  • high_blood_pressure has words of constant length

platelets

numerical

Approximate Distinct Count 176
Approximate Unique (%) 58.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 263358.0293
Minimum 25100
Maximum 850000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • platelets is skewed right (γ1 = 1.455)

Quantile Statistics

Minimum 25100
5-th Percentile 131800
Q1 212500
Median 262000
Q3 303500
95-th Percentile 422500
Maximum 850000
Range 824900
IQR 91000

Descriptive Statistics

Mean 263358.0293
Standard Deviation 97804.2369
Variance 9.5657e+09
Sum 7.8744e+07
Skewness 1.455
Kurtosis 6.0859
Coefficient of Variation 0.3714
  • platelets is not normally distributed (p-value 2.9307190381725105e-09)
  • platelets has 21 outliers

serum_creatinine

numerical

Approximate Distinct Count 40
Approximate Unique (%) 13.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 1.3939
Minimum 0.5
Maximum 9.4
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • serum_creatinine is skewed right (γ1 = 4.4336)

Quantile Statistics

Minimum 0.5
5-th Percentile 0.7
Q1 0.9
Median 1.1
Q3 1.4
95-th Percentile 3
Maximum 9.4
Range 8.9
IQR 0.5

Descriptive Statistics

Mean 1.3939
Standard Deviation 1.0345
Variance 1.0702
Sum 416.77
Skewness 4.4336
Kurtosis 25.3783
Coefficient of Variation 0.7422
  • serum_creatinine is not normally distributed (p-value 6.423463194426497e-15)
  • serum_creatinine has 29 outliers

serum_sodium

numerical

Approximate Distinct Count 27
Approximate Unique (%) 9.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 136.6254
Minimum 113
Maximum 148
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • serum_sodium is skewed left (γ1 = -1.0429)

Quantile Statistics

Minimum 113
5-th Percentile 130
Q1 134
Median 137
Q3 140
95-th Percentile 144
Maximum 148
Range 35
IQR 6

Descriptive Statistics

Mean 136.6254
Standard Deviation 4.4125
Variance 19.47
Sum 40851
Skewness -1.0429
Kurtosis 4.0311
Coefficient of Variation 0.0323
  • serum_sodium is not normally distributed (p-value 2.3930694674412726e-07)
  • serum_sodium has 4 outliers

sex

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB
  • The largest value (1) is over 1.85 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 1.85 times larger than the second largest value (0)
  • sex has words of constant length

smoking

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB
  • The largest value (0) is over 2.11 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 2.11 times larger than the second largest value (1)
  • smoking has words of constant length

time

numerical

Approximate Distinct Count 148
Approximate Unique (%) 49.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.7 KB
Mean 130.2609
Minimum 4
Maximum 285
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • time is skewed right (γ1 = 0.1272)

Quantile Statistics

Minimum 4
5-th Percentile 12.9
Q1 73
Median 115
Q3 203
95-th Percentile 250
Maximum 285
Range 281
IQR 130

Descriptive Statistics

Mean 130.2609
Standard Deviation 77.6142
Variance 6023.9653
Sum 38948
Skewness 0.1272
Kurtosis -1.2119
Coefficient of Variation 0.5958

DEATH_EVENT

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 19.3 KB
  • The largest value (0) is over 2.11 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 299
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 2.11 times larger than the second largest value (1)
  • DEATH_EVENT has words of constant length

Interactions

Correlations

Missing Values